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A FORMULATION AND ANALYSIS OF COMBAT GAMES 


M. Heymann,* M. D. Ardema, and N. Rajan^ 
Ames Research Center 


SUMMARY 


Most investigations of combat problems by differential game theory have focused 
on analysis of deterministic, two-person, zero-sum, perfect-information pursuit- 
evasion games. This framework is quite suitable for situations wherein the pursuer- 
evader roles are well defined by the nature of the problem and the evader has no 
offensive capability with which to threaten the pursuer to affect the outcome. The 
formulation is, however, an inadequate one to model combat between two (or more) 
opponents both (or all) of whom have offensive capabilities and offensive objectives. 
An obvious example of such a situation is air-to-air combat. The few attempts to 
analyze this more general combat problem have used either the concept of role- 
determination or, more recently, that of two-target differential games. Neither of 
these approaches, however, has led to a complete and consistent conceptual definition 
and corresponding mathematical theory of combat differential games. It is our pur- 
pose in this paper to formulate and illustrate such a theory. 

We begin with a discussion of the qualitative features of combat games between 
two aggressive opponents; this discussion indicates the rich variety of behavior pres- 
ent in such games and makes clear the inadequacy of the pursuit-evasion assumption, 
with or without role determination, for modeling combat. We then propose a mathe- 
matical formulation of a combat game between two opponents with offensive capabili- • i 
ties and offensive objectives in a deterministic setting. Resolution of the combat 
essentially involves solving two differential games with state constraints. Depend- 
ing on the game dynamics and parameters, the combat can terminate in one of four 
ways: (1) the first player wins, (2) the second player wins, (3) a draw (neither 

wins), or (4) joint capture. In the first two cases, the optimal strategies of the 
two players are determined from suitable zero-sum games, whereas in the latter two 
the relevant game is nonzero-sum. Next, to avoid certain technical difficulties, the 
concept of a 6-combat game is introduced. 

To illustrate the definition, formulation, and solution of combat games, an 
example, called the turret game , is analyzed in detail. This game may be thought of 
as a highly simplified model of air combat, yet it is sufficiently complex to exhibit 
a rich variety of combat behavior, much of which is not found in pure pursuit-evasion 
games. 


^Senior NRC Associate on sabbatical leave from Technion, Israel Institute of 
Technology, Haifa, Israel. 

^Research Associate, Aeronautics and Astronautics Department, Stanford Univer- 
sity, Stanford, California. 


1 . INTRODUCTION 


By a game of combat we intuitively refer to an encounter between two hostile 
adversaries, or players , each of whom wishes to destroy or capture the other, while, 
if possible, ensuring his own survival. A player who succeeds in capturing his 
opponent is said to win the game. If a player is unable to win, he will attempt to 
prevent his opponent from winning and, if he is unable to do so, he will try to make 
his opponent's win as difficult or as costly as possible. 

An application that immediately comes to mind and that was one of the main 
incentives for investigating games of combat, is the aerial combat problem in which 
there is a duel between two (or even more) maneuvering aircraft. Various situations 
can be visualized; for example, a missile in pursuit of a plane, a fighter in pursuit 
of a bomber, a duel between two fighter aircraft. 

In the simplest manifestation of a combat game, one of the players has no offen- 
sive capabilities so that he can never win in the above sense. Thus, the offensive 
player becomes the pursuer , and the inoffensive opponent becomes the evader . The 
resultant pursuit-evasion problem becomes what is sometimes called a game of survival 
(see e.g., ref. 1). The evader attempts to prevent capture or, if this is not 
possible, to maximize the pursuer's cost of attaining his goal; the pursuer endeavors 
to capture at minimum cost. The special (but important) case in which the cost func- 
tional is the time to capture, has sometimes been referred to as a game of pursuit- 
evasion (ref. 1). 

The early studies of combat problems focused almost entirely on the above men- 
tioned framework in which the two players have the clearly defined and opposing roles 
of pursuer and evader, or minimizer and maximizer. There are immediate applications 
of those studies in such problems as missile versus aircraft or fighter versus 
bomber. 


The generally accepted mathematical framework for formulating and solving 
pursuit-evasion problems is the theory of differential games due to Isaacs (ref. 2). 
Specifically, the game consists of a dynamical system whose state transition is 
governed by a set of n ordinary differential equations 

— = f(t,x,u,v) , (1) 

where x = x(t) £ is the state, u = u(t) 6 UC and v = v(t) 6 V C are 

the two players' controls, and x(t Q ) = x 0 is the initial state. There is asso- 
ciated with the game a cost functional 


J(u,v,x o ,t Q ) = g[x(t),t] + 


h[t ,x(t) , u ( t ) ,v(t) ]dt 




where t (<<») is a free or fixed termination time. Typically, player u attempts to 
minimize the cost J, whereas v tries to maximize it. Various additional assump- 
tions and constraints can be imposed on the problem to suit specific requirements. 
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In games of survival and of pursuit-evasion the termination time is free and is 
determined by a capture condition ; that is , a state constraint that is imposed on the 
problems as follows: 

x(t) ^ & C V t < t < t (3) 

and 

x(t) esr. (4) 

Here & is the target or capture set and t, defined through equations (3) and (4), 
is the capture time. The game of pursuit-evasion is thus the special case of the 
above framework in which g - 0 and h = 1 with the cost being the time to cap- 
ture (t - t Q ) . It is then natural to refer to the minimizing player u as the 
pursuer and to the maximizing player v as the evader. 

Much research has been done on games of survival and games of pursuit-evasion 
within the general framework of differential game theory (see, e. g. , refs. 1, 3-5). 
The early use of differential games in the modeling and analysis of aerial combat 
problems is reviewed in reference 6. Widely investigated models, although highly 
idealized, for pursuit-evasion analysis are the homicidal chauffeur game (refs. 2, 

7, 8), and its generalization, the game of two cars. The latter was used as a model 
for aerial combat analysis in a variety of studies; see, for example, references 9-14 
as well as the general survey article reference 15. More recently, various general- 
izations of the game of two cars were used for analysis of aerial pursuit-evasion to 
accommodate variable speed and other aircraft capabilities (refs. 16-19). Also, 
special techniques were developed and examined to facilitate computation and to 
alleviate some of the difficulties associated with high dimensionality. 

It was realized even in the early stages (ref. 6), however, that the pursuer- 
evader model is inadequate for situations such as fighter versus fighter combat, in 
which there is no justification for an arbitrary a priori role assignment of pursuer 
and evader. This difficulty led to much confusion in efforts to reconcile the dif- 
ferential game methodology with intuition, based on the perceived experiences in 
actual combat situations. For example, it was stated in reference 6 that a multiple 
criterion may be required to formulate these aerial combat encounters correctly and 
that role reversals during a given encounter might (and should) occur. In other 
studies, the idea was promoted that the central issue is that of role determination 
(refs. 12, 20). Basically, the method that was proposed for resolution of the role- 
assignment problem was to examine two pursuit-evasion problems — one with player A 
as pursuer and player B as evader and the other with the roles reversed. From the 
outcomes of the two games an assignment of roles should then result. 

Realizing that the pursuit-evasion framework is unsuitable for the analysis of 
combat problems, Pachter and Getz (refs. 21, 22) introduced (following the approach 
proposed by Blaquiere et al. (ref. 23) and Getz and Leitmann (ref. 24)) the concept 
of a two-target game . In this setting each player has a target-set and attempts to 
drive the state into his own target set without being first driven to the target set 
of his opponent. Pachter and Getz confined their attention exclusively to the prob- 
lem of capturability in a two-target environment, that is, to the "game of kind" (in 
Isaacs 1 terminology). This analysis is based on Isaacs 1 (controversial) technique 
of employing barrier and other "singular" surfaces (ref. 2) to partition the state 
space into regions where one or the other player can win the game. Although this 
approach appears to be plausible in very low dimensional problems, it becomes rapidly 
infeasible as the dimensionality increases, especially in the two-target case where 
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added complexity arises. Indeed, Pachter and Getz made some major simplifications in 
their target-set geometries to overcome essential difficulties with dimensionality 
and render the analysis tractable in their investigations of the two-target homicidal 
chauffeur game (ref. 21) and the two-target game of two cars (ref. 22). A further 
recent study that combines the role assignment point of view with the two-target idea 
has been reported in reference 25. 

Although the intuitive two-target idea is definitely a convincing conceptual 
setup for the combat model, the game formulation problem and the problem of strategy 
determination and analysis have not been addressed in the literature to date, appar- 
ently because of the problem T s perceived complexity. Thus, such generally erroneous 
ideas as role reversal and role assignment still prevail in the current literature 
and the players 1 actual optimal (or, at least, winning) strategies in combat situa- 
tions remain obscure and unexplored. In the present paper we examine the combat 
problem from a strategy-analysis point of view. 


2 . QUALITATIVE FEATURES OF COMBAT GAMES 


In an ordinary pursuit-evasion problem, the termination of the game (i.e., cap- 
ture) is determined, as we have seen earlier, by the capture condition specified by 
equations (3) and (4) (which can also be stated in terms of equality or inequality 
constraints). The capturability issue constitutes the "game of kind” (ref. 2), and 
for a given initial state the problem is whether the pursuer can actually force cap- 
ture or termination. The question of strategy, that is, of how to accomplish capture 
when possible or how to prevent it (or delay it) constitutes the game of degree 
(ref. 2). 

In the two-target combat model, with targets, say, and ^ v , the termination 
condition is 

x(t) V t < t < t , (5) 

U V o 


and 

x(t) € 9 , (6) 

where is the target associated with player u, and is the target associated 

with v. If x(t) G <^ u but x(t) $ for all t Q < t < t we say that player u 

wins the game, whereas if x(t) G and x(t) $ for to < t'.< t we say that v 

wins. If n 7^ <j> and 

x(t) e n ^ , (7) 

we say that the game ends in joint capture . Finally, if x(t) ^ & for all 
t 0 < t < 00 we say that the game ends in a tie or draw . 

In the pursuit-evasion game, the pursuing player wishes to lead the game to ter- 
mination and, if he can, do so as quickly as possible. The evading player attempts 
to prevent termination or to delay it if prevention is impossible. In contrast, in 
the two-target problem the players’ objectives are more complicated. In principle, 
both want to terminate the game but in different parts of Player u wants to 
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terminate in 9'/9’ v (i.e., in ff excluding ^ v ) , and player v wants to terminate in 

To see how these conflicting objectives affect the players’ strategies and to 
gain some insight into the actual situation, let us examine, qualitatively, a number 
of possible cases. Suppose for a moment that the capability of each player to evade 
his opponent’s target is independent of, and decoupled from, his capability to pursue 
his opponent. (Air combat between two aircraft with actively guided air-to-air 
missiles is an example.) Each player would then play two simultaneous and indepen- 
dent pursuit-evasion games: one is an offensive game in which he would try to cap- 

ture his opponent, the other a defensive game in which he would try to evade his 
opponent’s weapons. The pursuit-evasion game that terminates first would determine 
the winner. 

Suppose now that with each of the above mentioned pursuit games we associate a 
cost functional 


J i = g i [xCti) * t ± ] + J \[t ,x(t) ,u(t) ,v(t) ]dt i = u,v , 

t o 

where t-j_ is the termination time of game i. In game u, player u wishes to 
minimize, and player v wishes to maximize J u ; in game v, player v is the mini- 
mizer and u the maximizer. Suppose that if the competition between the two games 
is ignored, it turns out that t u is greater than t v as optimal termination times. 
By this optimality criterion it would be concluded, quite possibly erroneously, that 
player v is the winner of the combat. Thus, even in this elementary example, we 
are forced to add a constraint to each of the two games to account for the existence 
of the other, that is, for i = u,v,x(t)^ for all t G < t < t^ (where ^ 

denotes the target set of the game i) . This constraint introduces a coupling 
between the two competing games that affects the players’ strategies. A particularly 
interesting and important cost criterion is obtained when gj_ = 0 and h^ = 1, 
i = u,v, that is, when both games assume a time-optimality criterion. Let J* = t^ , 
i = u,v be the optimal times obtained in the two pursuit games where, in each game, 
the pursuer is the minimizer and the evader is the maximizer. Clearly, if t u < t v 
then in game v the constraint x(t) ^ for all t G < t < t v is violated and 

the constrained game v has no feasible solutions. Obviously u is the winning 
player. 

Now, in the more general case, the players cannot perform their evasive maneu- 
vers, that is, they cannot stay out of range of their opponents’ weapon envelopes 
(capture sets), independently of their offensive maneuvers to capture the opponent. 
Indeed, there is, typically, a trade-off between the two objectives of survival and 
capture of the opponent, and the players have to play their strategies accordingly. 

To illustrate the situations that might occur, let us consider two vehicles 
maneuvering in a horizontal plane (fig. 1). The arrows describe the vehicles’ instan- 
taneous heading, the cones the instantaneous envelopes of their weapons (fixed with 
respect to their headings) and the vertices of the cones are their instantaneous 
positions. We assume the typical situation that each player’s maximum turn rate and 
speed are mutually dependent, specifically that the faster they move the slower they 
are able to turn, and conversely. 
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Figure 1.- Maneuvering vehicles* 

Suppose that player v initially is in a vulnerable position (see fig. 1) such 
that by a slight turn of player u, v might enter u's weapon envelope, and further 
suppose that v is more maneuverable in terms of turn rate and speed than u. If v 
adopts a pure evasive maneuver and u pursues, two outcomes are possible: either v 

gets captured quickly or he evades successfully. If he can avoid capture initially, 
then presumably he can capture u in due time by virtue of his superior maneuvera- 
bility. In the other case, even though v cannot avoid initial capture by u if he 
adopts pure (inoffensive) evasion, he may still be able to win the game by performing 
the offensive maneuver of turning his own target at u, thereby capturing u before 
u captures him. More generally, v may have to perform a composite (offensive- 
defensive) maneuver wherein he turns his target at u while moving away from u T s 
target just enough to avoid capture himself. Thus, player v might be able to win 
the game by a composite strategy even though he would lose it by playing pure evasion. 

It is also of interest to examine the optimal play of player u under the 
assumption that v can win with an optimal composite strategy. In spite of his 
eventual capture, u should adopt an offensive behavior since by turning his weapon 
at v he forces v to slow down v f s offensive move in order to survive u T s 
threat, thereby at least delaying u's capture. In this situation, roles of pursuer 
and evader cannot usefully be assigned to the two players. 

Thus, it is easy to understand that in general, analysis of pure pursuit-evasion 
problems, with or without role-determination analyses, reveals little, if any, infor- 
mation about the possible outcomes and optimal strategies of a combat game. In fact, 
it may be expected that misleading conclusions will be frequently drawn. It is clear 
that a new and fundamentally different approach to the problem is required. 
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3. FORMULATION OF A COMBAT GAME 


Consider a system described by a set of n ordinary differential equations, 

•— = f ( t , x, u, v) , (8) 

with initial time t 0 and initial state x(t 0 ) : = x 0 . The controls of the two players 
are measurable functions taking values in compact subsets U C and V C 

respectively. 

Associated with the combat problem are two subsets and # v in $? n+1 , the 
(extended) targets or (extended) capture sets of the players. We shall assume that 
& u and v are the closures of some open subsets in ^ n+1 and that there exists a 
time T* > t Q such that for all x G and all t > T* , (t,x) G O S' v . 

Combat starts at t = t 0 and continues as long as 

[ t , x ( t ) ] g int S' 

where ST: = # u U is the combat T s (extended) terminal set and where int( # ) denotes 
interior. We shall say that the combat terminates at time t where 

t: = inf{t > t | [t,x(t)j G int &} . (9) 

If t = T*, we say that the combat ends in a draw . If t < T*, we say that player u 

(respectively, player v) wins the combat if there exists an e > 0 such that 

[t,x(t)] G int (respectively, [t,x(t)j G int S v ) for all t > t satisfying 
t - t < e. If both players win the combat we speak of joint , or simultaneous , 
capture . Thus, the combat can terminate in one of the following four ways: 

(1) player u wins; (2) player v wins; (3) a draw; and (4) joint capture. 

To obtain a consistent formulation of the combat problem, it is necessary first 
to resolve its decidability question. That is, each initial event (t 0 , x Q ) must be 
uniquely and unambiguously classifiable into one of the four termination cate- 
gories (l)-(4) above, thus partitioning the event space into mutually exclu- 

sive regions $ u , $ v , $ u v v > and ^uAv> respectively. 

To this end we define the players 1 termination preferences as follows. Player u 
ranks his preferences in order of priority as 1, 3, 4, 2, and player v ranks his 
preferences as 2, 3, 4, and 1. 

Remark 1- This ranking is consistent with the intuitive notion that each player 
wishes to capture his opponent while not being captured himself. It also resolves 
the ambiguity that might occur in deciding between outcomes (3) and (4) when (1) 
and (2) cannot be forced by either of the players. This last point becomes clear if 
we observe that outcomes (3) and (4) can occur in one of two essential ways: the 

players may be "locked into joint capture" in the sense that a unilateral attempt by 
one of the players to postpone termination will enable his opponent to win; on the 
other hand, if a player cannot force a win but has control over the time at which 
joint capture will occur, he will select the latest such time and, if possible, set 
it at T* (i.e., a draw). □ 
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It is readily noted that by definition, the regions <f> u , <|> v . <f) u y v , anc * ^uAv are 
invariant in the sense that there exist strategies for the players that maintain the 
resultant trajectory in its initial region until combat termination. Moreover, any 
sensible or, as we shall say, consistent strategies by the players will satisfy this 
invariance. Indeed, a trajectory will leave its initial region only if at least one 
of the players makes a fatal strategy error, in which case we say that the game 
strategies are inconsistent. 

Remark 2 - It is important to emphasize that in properly formulated and correctly 
played combat, the winning capabilities of the players depend only on the problem 
data (including the initial state) . No reversals of the winning capability (or 
"role") occur unless a fundamental error has been made by a player who relinquishes 
an advantage to his opponent. □ 

We now associate with the combat problem a pair of differential games, one from 
the point of view of player u, or the u-game , and one from the point of view of 
player v, or the v-game . 

Th e u-game G u is defined as follows. Given is a cost functional 

- - A 

J u = 6 u Ix( t u ) » fc u ] + J h u [t,x(t),u(t) ,v(t) ]dt , (10) 

*o 

with player u defined as the minimizer and player v as the maximizer. The ter- 
minal time is specified by 


t u = inf{t > tj [t,x(t>] e int S , 

subject to the event constraint 

[t,x(t) ] ^ int 9" V t < t < t . 

v o u 


The v-game G v is specified as follows. 


Given is a cost functional 


J 

v 


g^xCty),^] + 


A 

J h v [t,x(t) ,u(t) ,v(t) ]dt , 


( 11 ) 


( 12 ) 


(13) 


with player v the minimizer and player u the maximizer. The terminal time t v 
is given by 

t = inf{t > t |[t,x(t)] £ int & } , (14) 

v o 1 v 

subject to the event constraint 

[t,x(t) ] int V t < t < t v . (15) 


We examine now the role of the two differential games in the formulation of the 
combat problem. First, note that if (t 0 ,x Q ) E $ u U $ v exactly one of the games has 
feasible solutions (satisfying the terminal condition and event constraint) so that 
only the game of the winning player (the one with feasible solutions) can be played. 
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The winning player will then choose his strategy to minimize his own cost func- 
tional (subject to the terminal and event constraints of the game); his opponent, 
realizing that he has no alternatives (having no feasible solutions to his own game), 
will play to maximize his opponent’s cost. Thus, there results a zero-sum game 
with the winning player the minimi zer and his opponent the maximizer. 

In case (t 0 ,x Q ) E <j> u w ^ ^uAv 5 both games have feasible strategies, and the ter- 
minal times t u and t v coincide (to the least time in which each player can force 
capture of his opponent or a draw; see also remark 1). In this case each player will 
choose to minimize his own cost functional (while ignoring his opponent’s). The 
resultant game is a nonzero-sum game with event and terminal constraints. 

In summary, each player will choose his strategy to minimize the cost func- 
tional of his own game unless for the given initial conditions his game has no feasi- 
ble solution (that is, he is forced to lose the combat) in which case he will choose 
his strategy to maximize the cost of his opponent. 

Definition 1 - A combat problem formulated with the aid of dual differential 
games and with strategy selections as described in table 1 is called a combat game . □ 


TABLE 1.- STRATEGY SELECTION RULES 
IN COMBAT GAME 


Region 

<t> 

T u 

T v 

*uVv U ^uAv 

Strategy of 

u 

min J 

u 

max J v 

min J 

u 

Strategy of 

V 

max J 

u 

min J 

V 

min J 

V 


Remark 3 - Within the dual differential games framework proposed in the present 
paper, rules for strategy selections other than the one described above may be chosen. 
For example, we might have decided to select the players 1 strategies to maximize the 
opponents’ costs instead of minimizing their own when (t Q ,x 0 ) E ^ u Vv u ^uAv* 

Although this is logically consistent, we prefer the setup as proposed above since we 
find it more in line with expected intuitive response. From a purely mathematical 
standpoint it makes no essential difference what strategy selection rule is chosen so 
long as it is consistent and decidable. □ 

In games of combat in general, the terminal time is strongly influenced by the 
competing nature of the two differential games G u and G v . When (t 0 ,x 0 ) E <f> u Av> the 
terminal time is forced to be the least time in which the players can, respectively, 
secure termination of their game, and the times for the two games coincide. Conse- 
quently, a case of special interest and simplicity in terms of strategy selection, 
and one that is also of practical importance, is when g u = g v = 0 and h u = h v = 1, 
that is, the cost functionals of both games are the durations of the games. Thus, 
in the time-optimal case, for (t 0 ,x 0 ) G <j) u v v U ^uAv essentially every feasible 
strategy is optimal. 
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4. 6 -COMBAT GAMES 

With certain types of cost functionals, the combat game as formulated in 
section 3 may not have optimal strategies because of lack of closure of the space of 
admissible trajectories. Specifically, since the target sets and are closed, 

so is their (nonempty) intersection # u D Hence, the set # u *: = # u / 

the complement of # u H <jT v in <^ u , and the set (similarly defined), are not 

closed. As a result, a convergent sequence of winning trajectories for one of the 
players, say for player u, that terminate in # u *, need not converge to a trajectory 
that terminates at a point in # u * but, rather, in n ^ v# But, then, the limits 
ing trajectory ends in joint capture and is not winning for u. 

We now turn to reformulate the combat game to avoid the technical difficulties 
referred to above. To this end we introduce the concept of a 6-safety margin and a 
6-combat problem . 

Let 6 be a positive number and let ^ u a v (< 5) : “ <^ v ) denote the relative 
open 6-neighborhood of ^ that is, the set of all points cf) : = (t,x)e & 
whose (Euclidean) distance d (<\> ,^ u A #* v ) from H & v is less_than 6. Let 
^ u (6) : = ^ u /^uAv( 6) denote the points of that are not in ^ u Av^)* Similarly 

define ^ v (6) . 

The 6-combat initiates at time t = t Q and, just as before, continues as long 
as 

[t ,x(t) ] £ int f , 

with termination time t defined by equation (9). The winning conditions of the 
6-combat differ, however, from those of the "ordinary" combat as follows. 

We still say that the 6 -combat ends in a draw if t = T*. However, if t < T* 
we designate the following outcomes. We say that u wins if 

[t,x(t)j € # u (6) , (16) 

that v wins if 

[t ,x(t) ] € ^ v (6) , (17) 

and that the combat ends in joint 6-capture if 

[t,x(t) ] e ^ u a v (6) . (18) 

The 6-combat, just as ordinary combat, can end (1) by u winning, (2) by v 
winning, (3) in a draw, or (4) in joint 6-capture. The players/ termination prefer- 
ences are unchanged as before, and again the event space can be partitioned into 
regions according to the outcome of the 6-combat as <f) u (6), <J> V (6 ) , <f> u v v (d)> and 
^uAv^)’ respectively. 

The fundamental difference between the ordinary combat problem and the 6-combat 
problem is that in the latter case the winning terminal sets «^ u (6) and ^ v (6) are 
closed and that ^ u (6) H ^ v (6) = (j) so that u winning and v winning cannot occur 
simultaneously. Thus, in formulating the 6-combat game we have to proceed differ- 
ently than before. 
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First, we associate with the 6-combat problem a pair of cost functionals: 
J u (6), the cost from the point of view of player u: 


u 


t 


J (<5): » g [x(t),t] + I h [t,x(t),u(t),v(t)]dt , 


( 19 ) 


and J v (6) , the cost from the point of view of v: 


v 


f 


J(6): = g [x(t),t] + I h[t,x(t) ,u(t),v(t)]dt . 


v 


( 20 ) 


Each player will relate to one of the two cost functionals depending on the 
expected outcome of the game as follows* If (t 0 ,x 0 ) G <j> u (6) so that player u can 
force the game to terminate with condition (16) holding, both players will relate to 
cost functional J u (6), with u minimizing and v maximizing. Thus, a zero-sum 
differential game results with cost functional J u (6) and terminal constraints (9) 
and (16) holding. 

When (t 0 ,x Q ) G <J> V (6), player v can force terminal condition (17), and in the 
resultant zero-sum differential game player v minimizes and player u maximizes 
the cost J v (6) subject to conditions (9) and (17). 

Finally, if (t 0 ,x 0 ) e <f) u y v (<$) U ^ U A V (^) a nonzero-sum game will be played with 
each player minimizing his own cost functional with the terminal conditions (9) 
and (18) holding. 

Remark 4 - An example of cost functionals for which the ordinary combat game 
formulation as described in section 3 is valid and for which a 6-game formulation is 
not necessary is the following pair: 

J u ■ (-)d[(t u ,x(t u )),# v ] 

J v - (-)d[(t v>lt (t v )),» 1 . 


Here the cost is the negative of the (Euclidean) distance of the terminal event 
from the opponent’s target. Since the winning player is the minimizer, he maximizes 
this distance, whence his optimal strategy will never lead to zero cost (unless, of 
course, joint capture is the only possible outcome).. □ 

Remark 5- The terminal set ^uAv(^) not closed so that the nonzero-sum game 
for (t 0 ,x 0 ) G cj> u v v (6) U <f> u A v (S) ma Y sometimes not have optimal strategies. However, 
since it is impossible to secure closure of all three terminal sets # u (6) , ^ v (6), 
and^ u yy v (6) sinultaneously , it appears preferable to obtain a complete resolution of 
the winning games. □ 

Remark 6 - The parameter 6 can be chosen arbitrarily small to obtain an arbi- 
trarily close approximation of the ordinary combat situation but with optimal winning 
strategies existing. □ 
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5. THE TURRET GAME 


Formulation of the Game 

To illustrate the theory developed above we consider a combat game that repre- 
sents a simplified version of the air combat situation discussed qualitatively in 
section 2. Player u moves in a plane with arbitrary velocity relative to a fixed 
reference frame (X,Y) and can turn a ray weapon relative to a fixed direction at a 
bounded angular rate a (see fig. 2). Player v moves such that he is always at a 
distance R from u, and he can traverse this circle at an angular speed relative to 
a fixed direction at a bounded rate 3. Player v also has a ray weapon that he can 
turn relative to the line of sight between the two players at a bounded rate <j> . 

For convenience, we represent the problem in a relative reference frame with 
origin at u's position and the, y-axis along, u's weapon (fig. 3). Letting 
Xi = 3 - cx, x 2 = <f> , u = a, v x = 3, and v 2 = ~<j> the kinematical equations of motion 
are 

*i - Vj - u , x x (0) = x° , (21) 

X 2 = -V 2 , X 2 (0) = x° , (22) 

with x x and x 2 being computed modulo 2 tt, and where we take t Q = 0 since the sys- 

tem is autonomous. In view of the circular symmetry of the problem, it is easily 
seen that the playing space of interest is 

P = {(x 1 ,x 2 )|x 1 € [0,7t],x 2 G [ 0 , it ] } . (23) 



Figure 2.- Turret game in fixed reference frame. 
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Figure 3. - Turret game in relative ref erence frame. 

The controls are specified within the bounds 

0 < u < u (24) 

and 

( Vl iv 2 ) € VC#J', (25) 

where is the positive quadrant of gR 1 . 

Next, we choose T*, the maximum allowed time duration of the combat. The 
extended targets are then given by 

= { (t, Xl ,x 2 ) e*x p| Xi < e x if t < T*} 

and 

= { (t ,x 1 ,x 2 ) E 91 x P | x 2 < e 2 if t < T*} , 

where e x R and e^R are the radii of the vulnerability regions or capture sets of the 
two players (fig, 3). The extended joint capture region gT^ H is given by 

H §T “ {(t,x x ,x 2 ) p|x 1 < e p x 2 < e 2 if t < T*} , 

In the ensuing discussion we shall for the most part (except when dealing with draw 
regions and associated strategies) assume that T* is sufficiently large so that we 


13 



can ignore the t dimension of the target. Thus we shall make reference to the 
(ordinary) target sets (see fig. 4) 

= {(x*l,X 2 ) G P|x x < £-^} 9 

= {(x 1 ,x 2 ) G p|x 2 < e 2 } , 

^ nr v = {(XpX^ 6 p|x], < Ej.Xj < e 2 } . 


x 2 



Figure 4.- Ordinary target sets. 

Upon specifying the safety margin 6 > 0, we obtain the 6-capture sets as 
depicted in figure 5. The 6-combat terminates at time t < T* if t is the first 
time the state intercepts the set <^ u (6) U ^ v (6) U with an inward velocity. 

Alternatively, combat terminates at T* in a draw. 6-capture of v by u occurs 
(if it occurs at all) if at termination with t < T*, 

x 1 (t) = e l9 x 2 (t) > e 2 + 6 and x r (t) < 0 . (26) 

Similarly, 6-capture of u by v occurs if at termination with t < T*, 

x x (t) > e x + 6 , x 2 (t) = e 2 and x 2 (t ) < 0 . (27) 

Next we will analyze this game for different cost functionals and different 
cases of the control set V. 
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Figure 5.- 6-capture sets. 


Linear Control Constraint 
Let V (fig. 6) be given by 

V x > 0 , v 2 >0, + (28) 

V], v 2 

where v x and v 2 are preselected positive bounds. (This case may be viewed as a 
convexification and approximation of the typical situation in which v ! s motion is 
limited by a lateral acceleration constraint specified by v 1 v 2 < k x and by 
bounds v 1 < k 2 and v 2 < k 3 , dashed lines on figure 6.) 

The cost functionals are chosen as 



and 
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Figure 6 Linear control constraint. 

where C 1 and C 2 are positive constants. These cost functionals reflect the com- 
bined (weighted) objective of the winning player of minimizing the termination time 
while securing maximum safety, that is, maximum final distance from the opponent’s 
target set, and conversely for the losing player. 

Before beginning the detailed analysis of optimal strategies, we examine the 
implications of the termination conditions (26) and (27). First, note that v can 
always win from suitable initial conditions because, from equations (22) and (28), he 
can always satisfy the third condition of equation (27). On the other hand, for u 
to win he must be able to force the third condition of equation (26), x x (t) < 0; this 
implies, using (21), (24), and (28), that 

u > v 1 (31) 

must hold. Thus, the relative magnitudes of u and v 1 are of key importance and we 
begin by studying the game with (31) holding. 

First, consider u’s winning 6-game. From table 1, in this game u wishes to 
minimize and v to maximize J u (6) in equation (28). In order to employ the stan- 
dard necessary conditions, the Hamiltonian is defined by 

H = A o C 2 + X 1 (v 1 - u) - X 2 v 2 . (32) 

We may set A 0 = 1 (since X Q = 0 adds no new candidates for optimal control) and 
A-l and A 2 are constants (because H does not depend on x x or x 2 ) . If u*, v^ f , and 
vj are optimal controls, then 
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( 33 ) 


and 


u*,v*,v* = arg( min max H) , 
0<u<u v x ,v 2 SV 


H(X 1 ,X 2 ,u*,v*,v*) = 0 . 

The termination condition is 

x x (T) = e x . 

and the state constraint is 

x 2 (t) > e 2 + 6 , V t e [0,t] . 


(34) 


(35) 


(36) 


From equation (29) , the transversality conditions give X 2 as 

X 2 = X 2 (t) = 3J u (6)/9x 2 = -2C 1 x 2 (t) < 0 . (37) 

Therefore, we may write equation (32) as 

H = C 2 + X 1 (v 1 - u) + 2C 1 x 2 v 2 , (38) 

where x 2 = x 2 (t) . 

To determine the optimal controls from equations (33) and (34), the sign of X x 
is needed. If X x < 0, then from equations (33) and (38) = 0, = v 2 , and 

u* = 0; therefore, H is the sum of two positive terms, violating condition (34). 
Similarly, X 1 = 0 leads to violation of (34) and thus 

X x > 0 . (39) 


It follows that the optimal control for u is 


u* = u 


(40) 


and the optimal controls for v are 

v*,v* = arg[ max (X x v x + 2C 1 x 2 v 2 )] . (41) 

v i» v 2 eV 

The optimal solution of this simple linear programming problem will always lie on the 
constraint ’■ 

v 2 = " fj- v i + v 2 0 < v a < v 1 (42) 

so that equation (41) becomes 


v* = arg[ max (Xj^ - 2C 1 x 2 v 2 /v 1 )v 1 ] . (43) 

0<v 1 <v 1 
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There are three possibilities: 


h > 2 C i x 2 v 2 /v i ■* v ? " v i ■ » v* = 0 , 

< 20^2^2/^! => v* = 0 , v* =■ , 

X 1 = 2C 1 x 2 v 2 /v 1 => v*,v^ singular . 


Next, consider v’s winiling 6-game with condition (31) holding. Now, u 
wishes to maximize and v to minimize equation (30) subject to 


x 2 (t) 

= e 2 

(45) 

and 



x i (O > + 6 

v t e [o,t] . 

(46) 

Proceeding as before, we conclude that X x 
possible optimal controls are as given by 

= u* = u, X 2 > 0» and v's 

(44) ; however, in this case only the 

choice 


(44b) satisfies (34). 

We are now in a position to partition the game’s playing space (eq. (23)) into 
the regions $ u (6) (u wins), $ v (6) (v wins), $ u a v (^) (joint 6-capture), and 
$uVv(S) (draw). Player u's winning 6-game will be feasible if and only if he is 
able to satisfy equations (35) and (36) for all of player v ? s admissible controls. 
At the boundary of $ u (6), v will just be able to make (36) an equality with con- 
trols (44b). With this choice of controls, (35) and (36) give 



(44a) 

(44b) 

(44c) 


that is. 


x° - e - 6 v 9 
22 2 



which defines $ u (6). 


(47) 


Similarly, v T s winning 6-game will be feasible if and only if (45) and (47) are 
satisfied. At the boundary of $ v (6), v can just achieve (45) with (46) an equality 
at t = t and his controls thus will be (44b) here. Integrating (21) and (22) with 
this control choice and using (46), the specification of the region $ v (6) is obtained 
as 



(48) 


Since under condition (31) there can never be a draw (u will always eventually 
win with u* = u unless v does so first), the region of the playing space not 
satisfying either (47) or (48) is a joint 6-capture region, $ u Av(^)> where the opti- 
mal strategies are (40) and (44b). The curves defined by equalities in (47) and (48) 
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divide the playing space into regions of different outcomes and, therefore, following 
Isaacs (ref. 2), may be termed 6-barriers. 

There remains the problem of determining v’s optimal controls in region $ u (6). 
We accomplish this by direct computation of J u (6) in equation (29) for the three 
control possibilities (44) and comparison of the results. For (44a) and (44b), we 
have 


= -C x x 2 2 + C 2 (x° - e x ) / (u - v 1 ) (49) 

and 

J ^ b) (6) = -C x [x 2 - y 2 (x x - e x )] 2 + C 2 (xJ - e x )/u . (50) 

The optimal controls for v between the choices (44a) and (44b) will be (44a) when 
j( a >(6) > 4 b >(6), giving 


Vi 

t - v x > C lY 2 t 2x 2 ° - y.2 ^ X 1 

(where y : = v x /u), and they will be (44b) when this inequality is reversed. 

Next consider the surface (line) separating these two regions, defined by 
J^ a /(6) = J^ b ^(6). Solving for x 2 , 


(51) 


x 2 =y 


K - e l > 


C V 
2 ' 1 


2C 1 y 2 u( 1 - y x ) 


(52) 


The slope of this line is Y 2 /2, or, from (47), one-half the slope of the boundary 
line of 3> u (6). The intercept of (52) with u’s target set is 




C y 
2 ' 1 


2C lY2 u(l - y ) * 


(53) 


Now consider the singular case (44c). Substituting (40), (42), and the condition 
(44c) into (38) and invoking (34) gives, upon solving for x 2 , 


C 2^1 


X 2 = 


2C 1 y 2 u( 1 - Y x ) 


This agrees with (53) and thus (52) is a singular surface (44c) with termination con- 
dition (53). The controls on this surface are found by setting dx 2 /dx 1 = Y 2/2 
and using (21) and (22) and invoking (42): 



dx i v i - u 


( v 2 /v i) v i " 



v. 


y_2_ 

2 * 
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u 


which gives 




v 


1 - y 
2 2 - Y 


rj 



( 54 ) 


It remains to test the optimality of these controls. A straightforward computation 
of the costs using controls (44a), (44b), and (54) shows that 

J (a) (6) = J (b) (S) < J (c) (6) 
u u u 

for any initial state on the locus (52); therefore, the controls (54) are optimal for 
v in this case. 

We can now completely specify the optimal strategies (controls) for the two 
players for case (31). In the region ^^(d) (see fig. 7(a)), v plays (44b) until 
termination. In $^ 2 ^(6), v plays (44b) until the surface (52) is reached and then 

plays (54). In $^ 3 ^(6), v plays (44a) until (52) is reached and then plays (54), 
and, in (6), v plays (44a) until termination. The region <£> u (6) is given by 

$ u (6) = U $,^(6). In the regions ^ 1 ^(6) and $ u a v (<$), v plays (44b). Player u 
i=i 

plays u in all regions. Note that in figure 7(a) the partition of the playing 
space is described for the case 

C Y 

cs 2 r 1 

e 0 + 6 < ^ — - "7i r < 7T . 

2 2C lY2 u(l - Yl ) 

This figure also shows example optimal trajectories in each region. 

Next, we consider the case 

u = v x . (55) 

In this case, v can always prevent the third condition of (26) from being satisfied 
and thus u can never win (from initial states outside the target's interior). 
Therefore, only v ! s winning game need be considered. Reasoning exactly as before, 
the region $ 1 ^ 1 ^(6) is defined by (48) and the optimal controls in this region are 
(40) and (44b) . 

In the region of the playing space defined by (47), a simple calculation using 
(21), (22), (42), and (55) shows that selection of controls (44a) results in a draw, 
whereas selection by v of any other controls subject to (42) gives a trajectory 
with slope y 2 , resulting in capture of v by u; therefore, v T s optimal controls 
are (44a) in this region, denoted ^uVv(^) * -*- n region not satisfying either (47) 
or (48) , the controls (44a) again give a draw, whereas any other controls for v 
result in joint 6-capture. According to the hierarchy adopted in section 3, v 
prefers a draw to joint 6-capture and therefore v's optimal controls in this 
region, denoted $^y^(6), are also (44a). The region $ uVv (6) = $^^(6) U $^^(6) 
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(c) Y 1 > 1. 
Figure 7.- Concluded. 
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is then given by the converse of ■ (48) . and all trajectories in this region are sta- 
tionary with t = T*. The regions (6) and # uVv (6) for (55) holding are shown in 

figure 7(b), along with example optimal trajectories. 

The last case to consider is 

u < v 1 . (56) 

From (21), (22), (26), and (27), it is obvious that now v can capture u from any 
position in the playing space P that is not in the interior of u’s target, and 
therefore $ v (6) = P/int ^ u (6). The necessary conditions again give. (40) as u f s 
optimal control and show that v T s optimal controls will lie on (42). In this case, 
however, the necessary conditions do not establish the sign of X 2 ; further, the 
optimal controls of v are frequently nonunique, and the optimal trajectories are 
frequently on the boundaries of the playing space. This means that the necessary 
conditions are of little use in determining v f s optimal controls and we use direct 
comparison of cost functionals instead. 

The minimum time-to-capture from an arbitrary point (x^,x 2 ) in the region (48) 
is obtained from using controls (44b) in (21) and (22); the result is 

t’ = (x 2 ° - e 2 )/v 2 . 


At this time, 

x i = -< x 2 ” e 2 Vy 2 + 


and thus from (30) , 

j;<«) = -Cjxl - (x° - e 2 )/y 2 ] 2 + C 2 (x° - e 2 )/v 2 . 


At t T , v has the option of forcing penetration immediately or of playing controls 
(44a) and forcing penetration of his target at some later point x" at an additional 
time increment t". Again from (21), 

x” = (v 1 - u)t” + x[ 

and the cost is 


J" 

v 


( 6 ) 


-CiX 


»»2 

1 



x i + ( x 2 " £ 2 )/y 2 
+ — — •= — 



(57) 


But this function will have a minimum with respect to x” e [x^,tt] at either x'^ = x^ 
or x" =t t. Thus, we need to compare the cost (57) for x'^ = x* with that for 
x’^ = 7T. The result is that the minimum-time path will be optimal if 


xr + TT - 


X 2 - 


C 1 u(y 1 - 1) 


(58) 


and the path ending at (^>^ 2 ) will be optimal when inequality (58) is reversed. Note 
that because of the linearity of (21), (22), and (42) all paths using any sequence 
of controls satisfying (42) and reaching (iT,e 2 ) without violating the constraints 


23 



will take the same time, and thus the optimal controls and paths are not unique in 
this case. 

The line separating the two regions (obtained by replacing the inequality by 
equality in (58)) has slope y 2 and is thus parallel to the minimum time paths. This 
surface intercepts v's target at Cg/tC^u^ - 1)] - it. 

The regions and example trajectories are shown in figure 7(c) for the case (56) 
and. the condition e 2 + 6 < C 2 /[C 1 u(y 1 - 1)] — tt < tt. The optimal paths in region 
$’v 2 ' > ( ( S) are nonunique (only the two extreme paths are shown) and all end at 
(e 1 + 6 , e 2 ) in minimum time; the paths in $.( 2 )(<$) are Unique and minimum time; and 
the paths in $(. 3 '(6) are nonunique and all end at (it, e 2 ). If 
C 2 /[C 1 u(y 1 - 1)] - tt < e 2 + 6, all optimal trajectories end at (tt, e 2 ) and if 
C 2 /[C 1 u(y 1 - 1)] > 2 tt all trajectories are minimum time. 

A special case of these results is time-optimality (C x = 0 and C 2 = 1). In 
this case, the regions in which the optimal trajectories are not time-optimal vanish 
(specifically, $ 1 ] 1 '(6), $^ 2 '(S), $^ 3 '(6), and $^ 3 ^(6) in fig. 7). Also, in region 
(48) u's control is not defined since it has no effect on the outcome of the game. 
These results are summarized in figure 8; the optimal strategies in each region are 
obvious . 



(a) y j < 1 . 

Figure 8.- Regions and optimal trajectories in the playing space: turret game 

with linear control constraint and time optimality. 
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Circular Control Constraint 


Now let V (fig. 9) be given by 


v x > 0, v 2 > 0, Vj + v| < v 2 , (59) 

where v is a preselected positive bound, and choose both costs as time-to-capture 


J u («) - J V W) - 


r 


dt . 


(60) 


As before, inspection of (22) and (27) shows that v can always force termina- 
tion (i.e., x 2 (f ) < 0) and is therefore always capable of winning, whereas from (21), 
(26), and (59) u can win only if 

u > v . (61) 


We begin by investigating the game with this condition holding and first consider 
u’s winning game, that is, u minimizes and v maximizes (60) subject to (21), (22), 
(24), (35), (36), and (59). The Hamiltonian is 

H = 1 + A 1 (v 1 - u) - X 2 v 2 + p(v 2 - v^ - v^) , (62) 

where the (ordinary) multiplier y satisfies 



Figure 9.- Circular control constraint. 
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V - 0 

VI > o 


if 

CM H 
> 

+ v: 

if 

v 2 x 

+ v : 



and and X 2 


are constants. The optimal controls for v must satisfy 


_9H_ 

8vj, 

_9H_ 

9v 2 



( 63 ) 


(64) 


Since y = 0 violates the condition H = 0, required for optimal controls, (63) 
implies that v's optimal controls satisfy 

v^ + v 2 = v 2 . (65) 

Further, H = 0 and conditions (64) imply that 


so that 


A x > 0 and \ 2 < 0 


( 66 ) 


u* = u . 


(67) 


If (36) is satisfied with strict inequality at t = t, the transversality conditions 
give X 2 = 0, and from (64) and (65), for this case, 

v* = v, v* = 0 . (68) 


To determine the boundary of u's winning region, note that v desires to 
choose controls (subject to (65)) to make this region as small as possible. Assuming 
constant controls and using (65), (21) and (22) may be integrated to give 

e x = x x + (v x - u)t (69) 

and 


x. 


= x 2 - (v 


-2 


' 2 y 1/2 t 


Evaluating (36) at t = t and using (69) and (70) gives 


(70) 


x° - e 2 - 6 ^ (v 2 


v 2 ) 1 / 2 i 


- e. 


u “ v. 


(71) 


Therefore j u f s winning region will be smallest when v 1 = Y^u, where y = v/u; 
putting this value in (71) gives the specification of region $ u (<$) as 


x° - e 2 - 6 > (x x - £;l )/(1/y 2 - 1) 1/2 . (72) 

The optimal strategies in this region are (67) and (68). 


Next, consider ,v T s winning 6-game for (61) holding; now u maximizes and v 
minimizes (60) subject to (65) and (66). The Hamiltonian is (62) as before, but now 
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p i 0 in (63). Proceeding as before, if (46) is satisfied with strict inequality 
X 1 = 0, then v's optimal controls are 

vf = 0, v£ = v (73) 

and u' s control is indeterminate. For (46) satisfied with equality at t = t, u's 
optimal control is given by (67) and v’s optimal controls are constants satisfying 
(65). Integrating (21) and (22) with these controls gives 

t = (x° - e 2 )/(v 2 - vf) 1 / 2 

e, + 6 = (v, - G)(x 2 ° - e 2 )/(v 2 - v 2 ) 1 / 2 + x° 

Solving the latter equation for v 1 , 

- 1 - n[y 2 (1 + n 2 ) - 1] 1/2 
v x = u LiJ — * — 1 L - ' ~ i 

1 + n 2 



where 



(76) 


The region in which this control will be used is the region in which (75) will have 
real solutions satisfying (65), that is, 

(1/y 2 - l) 1 / 2 < n £ l/y • (77) 

Note that the controls corresponding to the lower and upper bounds in (77) are 
v 1 = y 2 u and v x = 0, respectively. 

We have now determined the regions and optimal controls for the case (61). In 
$ u (6), defined by (72), the optimal controls are (67) and (68). In 0^ 1 ^(6), defined 
by 

n > 1/Y , (78) 

the controls are (73) with u indeterminate. In (j)^ 2 )(6), defined by (77), the opti- 
mal controls are given by (65), (67), and (75). The region that remains, that is, 
that which satisfies 


x 2 - c 2 


x i " e i 


X 2 - £ 2 

" X 1 - E ! - « ’ 


(79) 


is a joint 6-capture region (a draw is not possible in this case) with controls (67) 
and 


v* = y z u , v* = y u (1 - Y 2 ) 1//2 


(80) 


These regions and example optimal trajectories are shown in figure 10(a), 
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(a) y < 1. 

Figure 10.- Regions and optimal trajectories in the playing space: turret game 

with circular control constraint and time— optimality . 
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Figure 10.- Concluded. 


The cases u = v and u < v may be easily inferred from the results for the 
case (61). As y -* 1 from below, the slope of the boundary line between regions 

and $ u y\v^) becomes infinite, and region $ u (6) vanishes. Further, v cannot 
capture in any finite time from points on this boundary and, therefore, there is a 
draw region, the extent of which depends on the prespecified value of T*> As for 
the linear constraint case, this region is subdivided into two subregions. In the 
first, $uVv(^)> the draw is forced in the sense that the only alternative to draw for 
v is capture by u. In the second, , v has the unilateral choice between 

draw and joint 6-capture , and according to our assumption he chooses draw. 

To find the boundary of $uVv^)’ we integrate (21) and (22) using (65) and (67) 
backward from (e x , e 2 + 6) over an interval of time T*; the result is that (6) 

is defined by 

(x° - E 2 - 6) 2 > 2uT*(x° - £l ) + (x° - e 1 > 2 . (81) 

To find the boundary between $^^(6) and $^^(6), we integrate just as before, but 
start from terminal condition (e l + 6, e 2 ); the result is 

(x° - e 2 ) 2 > 2uT*(x° - Sl - 6) + (x° - e, - 6) 2 . (82) 
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The region defined by (82) and the converse of (81) is $uVv^)*' T ^ e re gi°n defined 
by the converse of (82) is 0 V (6), which is subdivided into regions $,£^'(6) and 
^(5) as before (see fig. 10(b)). 

Finally, for u < v (fig. 10fc)), $ v (6) is the entire playing space, again com- 
posed of regions $y^(S) and $^ 2 '(6). 


6. DISCUSSION 


Although the turret game is a very simple and idealized problem, analysis of 
this game has revealed a rich variety of combat phenomena. First, note that the solu- 
tion to this game exhibits many features commonly found in differential games, such 
as the existence of barriers and various types of "singular" surfaces. Two examples 
of singular surfaces are given by (52) and (58) with the inequality replaced by 
equality. We remark that the first of these is a singular surface of type (+, u, +), 
or a "universal” surface, in Isaacs 1 terminology (ref. 2), and the second is a surface 
of type (p, u, -). 

The optimal strategies also exhibit features common in differential game solu- 
tions. In most regions of the state/parameter space, the optimal controls of both 
players are unique and constant. There are regions, however, in which the controls 
are (1) two-stage constant ($^ 2 )(6) end $^ 3 )(6) in fig. 7(a)), (2) nonunique ($^ 2 ^((S) 
and $^ 3 )(S) in fig. 7(c) and $^'('6) in fig. 10), or (3) state-dependent (<f>^ 2 )(6) in 
fig. 10). We note that in many cases the optimal strategies are obvious; for example, 
from initial conditions in the region $^ 1 ^(6) in figures 7 and 10, player v can 
capture u, before being himself captured, by simply "standing" and turning his 
turret at the maximum rate. 

The turret game solution, however, shows that combat problems have many features 
not previously encountered in games of survival and pursuit-evasion. Perhaps the 
most interesting of these is the existence of a region (a surface in the limit as 
6 0) in which both players are locked into mutual destruction at the earliest possi- 

ble time ($u/v v ) in fig s * 7(a) and 10(a)), in the sense that any deviation from this 
policy by one of the players will result in his unilateral capture by the other. 

This situation has been found to occur also in nonoptimal air combat simulations 
(ref. 26). Another interesting new feature is a region in which one of the players 
has the unilateral choice between a draw and mutual destruction at a time of his 
choosing in figs. 7(b) and 10(b)). These new phenomena are of obvious 

importance to air-to-air engagements and other forms of combat. 

The idea of 6-combat, introduced to solve technical problems concerned with 
closure properties of the target sets, also has important practical implications. In 
the turret game, 6 is the closest the winning player is allowed to approach his 
opponent’s target at termination. Thus, the winning player can choose 6 to specify 
the degree of risk of his own capture he is willing to accept. 

It was remarked in section 3 that the event space in a combat game as defined 
here must be uniquely and unambiguously classifiable into the four regions $ u (5), 
$ v (6) , 4 > u Vv( 6) > and ^uAv(^) 5 an< ^ further that these regions are invariant in the 
sense that there exist strategies that keep the trajectory within the region in which 
it starts until termination. The solution of the turret game clearly has these 
features. Moreover, the subregions such as $^^(6) in figure 7(a) and $^■^(6) in 
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figures 7 and 10 are themselves invariant in this same sense. Therefore, the outcome 
and the optimal strategies of the combat depend solely on the problem data (including 
the initial state). 

Because of past emphasis on pursuit-evasion problems, it is of interest to exam- 
ine the turret game from a pursuit-evasion standpoint. First consider the (time- 
optimal) pursuit-evasion game with u the pursuer and v the evader (u/v) subject 
to (24) and (28). The necessary conditions give the optimal strategies as u * = u, 
v i “ Vp and v* = 0 for v-, < u, and capture occurs if v x < u. For v/u, the opti- 
mal controls are v* =0, v* = v 2 * and u* undefined, and capture always occurs. 

Now suppose we (naively) attempt to construct the combat results from the 
pursuit-evasion results by assuming that whichever pursuit-evasion game ends in the 
least time will be the one played. Then, for v x > u, the v/u game will be played 
everywhere. For v x < u, the times of the two games must be compared. Integrating 
(21) and (22) with the two sets of controls shows that the u/v game will be played 
if 



and conversely for the v/u game. These results are shown in the playing space in 
figure 11. Note that the slope of the boundary line (83) is greater than the slope 
of the boundary lines (47) and (48) . 

Comparing figures 8 and 11, we see that in regions (*) on figure 11 the two 
analyses give the same solutions, but that in the other regions the solutions are 
dramatically different. In all these other regions, the pursuit-evasion solution 
indicates that v will win, v T s optimal strategy is (0, v 2 ) , and u f s strategy is 
immaterial. In region (1), however, the combat game results show that if u plays 
u then u will win, and, moreover, he will win in minimum time if v persists in 
playing his pursuit-evasion-derived strategy. In (2) , the combat results show that 
the best v can achieve is a draw and that he must play (v x , 0) to do this; if he 
plays his pursuit-evasion strategy u will win. And in (3) , v can in fact win but 
he must recognize and avoid u’s target to do so. 

Thus, from v ! s standpoint, the pursuit-evasion results frequently tell him he 
can win when he cannot; moreover, use of the pursuit-evasion strategies frequently 
will cause v to be captured when capture is avoidable, or lead him to be captured 
in minimum time when capture cannot be avoided. From u’s standpoint, the pursuit- 
evasion results frequently tell him he will be captured and that his strategy selec- 
tion is of no consequence, when in fact he has winning or draw strategies. Thus, the 
serious fallacy of using pursuit-evasion methods to "solve" combat problems (i.e., 
differential games between opponents with offensive capabilities and offensive objec- 
tives) is clear. 

As a final point in our discussion, we wish to reemphasize the role of threat in 
optimal strategy selection in suitably formulated combat games. We have clearly seen 
above that both players combine offensive and defensive behavior in their optimal 
strategies. The winning player takes defensive measures to avoid being captured him- 
self during his offense. At the same time the losing player applies a threat to the 
winner, that is, implements his offensive capability, in order to prevent his oppo- 
nent from using the most damaging strategies (in terms of the formulated game’s 
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cost). The result is that in a properly formulated combat game, just as in actual 
combat, both players combine a suitable blend of offensive and defensive maneuvering. 



(a) y x < 1 . 


Figure 11.- Results for turret game based on minimum-time pursuit-evasion games. 
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